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I.  INTRODUCTION 


One  of  the  main  driving  forces  in  solid-state  electronics  is  the 
thrust  to  develop  smaller  and  faster  active  devices,  which  dissipate 
very  little  power,  for  eventual  use  in  large-scale  integrated  circuits. 
In  most  situations,  however,  the  actual  power-delay  product  of  the 
device  itself  is  not  as  important  as  constraints  of  charging  the  inter¬ 
connection  capacitances  of  the  wiring.  In  this  introduction,  we  show 
that  there  is  a  fundamental  lower  bound  to  the  delay  time  itself.  This 
limit  arises  from  the  necessity  for  heat  removal,  the  necessary  require¬ 
ments  for  signal  propagation,  as  well  as  the  speed-power  product  itself. 
At  the  fundamental  limit,  the  delay  time  is  insensitive  to  modest  varia¬ 
tions  in  the  actual  speed-power  product.  At  the  opposite  extreme,  in 
the  wire-dominated  chip,  the  cell  size  is  determined  by  wiring  con¬ 
straints  rather  than  device  cdnstraints  and  the  delay  time  is  also 


independent  of  the  speed-power  product. 


Power-Delay  Products 


1—  1 


Integrated  circuit  technologies  are  often  compared  to  each  other 
through  the  speed-power  product  of  a  single  gate,  typically  an  inverter, 
and  these  are  measured  through  the  construction  of  analog  circuits  such 
as  ring .  oscillators .  Such’ comparisons'  rely  upon  the  fact,  that  when 
properly  defined  and  measured,  the  speed-power  product  can  be  equated 
with  the  minimum  energy  heeded  to  perform  a  logic  operation.^  Although 
the  power-delay  product  is  typically  viewed  as  being  a  technology  deter¬ 
mined  parameter,  several  authors  have  recently  argued  that  there  are 

fundamental  limits  to  how  small  the  energy  dissipation  (and  therefore 

2 

the  speed-power  product)  can  be  made.  Landauer  calculated  a  thermo¬ 
dynamic  limit  for  switching  of  a  single  particle  in  a  bistable  potential 
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''  3 

well  and  found  a  value  of  kT*ln(2).  Bate  ,  on  the  other  iu.nd, 

considered  the  switching  of  a  group  of  particles  between  two  collision- 

ally  broadened,  quantum  mechanical  energy  levels  and  found  that  the 

•19 

minimum  energy  cannot  be  reduced  below  10  Joule  for  an  error  proba- 
-30 

bility  of  10  .  This  is  not  an  unrealistically  conservative  design 

goal  for  an  error  probability,  although  it  is  considerably  beyond  cur¬ 
rent  technology.  On  the  other  hand,  computer  simulations  of  Josephson 

•  18 

junction  logic  elements  have  yielded  power-delay  products  of  10 
4 

Joule  .  While  the  model  of  Bate  is  abstract,  it  is  hard  to  see  how  the 
additional  complexities  present  in  real  inverter  circuits  would  either 
increase  the  speed  or  decrease  the  power  dissipation.  Indeed,  the  limit 
is  conservative  in  many  respects.  First,  the  possibility  of  gain  is 
neglected.  Gain  is  characteristic  of  all  real  logic  gates  and  requires 
additional  energy  dissipation  for  e.g.,  the  charging  of  interconnecting 
lines.  Secondly,  the  limit  proposed  assumes  rapid  operation  or  short 
delays.  The  cross-over  to  the  thermodynamic  limit,  similar  to  that  of 
Landauer  for  slow  operation  lies  in  a  range  of  10-1000  GHz  clock 
frequency. 

The  power-delay  product  discussed  above  would  actually  be 
associated  with  an  inverter  circuit  composed  of  the  bare  devices  them- 
selves.  However,  a  practical  integrated  circuit  also  has  lines  or  wires 
associated  with  each  gate.  These,  lines  have  "a  power-delay  product 
associated  with  them  and  generally  dominate  the  actual  energy  dissipa¬ 
tion  in  the  logic  gate.^’^  The  extreme  arises  when  the  necessity  for 
wiring  dominates  the  cell  size  and  the  device  is  no  longer  critical  for 
setting  energy  dissipation.  This  latter  case  is  referred  to  as  the 
wire-dominated  chip.^  We  return  to  this  latter  case  below. 
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There  are  other  constraints  that  must  also  be  placed  on  the  power 
dissipation  and  the  delay  time  in  an  integrated  circuit.  To  illustrate 
these,  we  consider  a  square  array  of  N  logic  cells  of  area  A  each.  If 
we  can  remove  Q  Watts  of  heat  per  unit  area,  we  must  require  that  the 
dissipated  power  P  satisfies 

P  <  QA.  (1) 

However,  this  condition  only  establishes  the  fact  that  a  steady-state 
temperature  exists  and  does  not  insure  that  this  temperature  will  be 
low.  Thus,  this  is  a  "worst  case"  limit.  A  second  limit  that  must  be 
invoked  is  that,  in  the  worst  case,  a  signal  must  be  able  to  propagate 
across  the  entire  chip  in  the  delay  time  t^.  This  signal  propagation 
requirement  is  expressed  as 

i  ctd  ,  (2) 

where  c  is  the  speed  of  propagation  and  is  typically  lower  than  the 
speed  of  light  due  to  the  dielectric  properties  of  the  semiconductor 
chip. 

From  the  above  discussion,  we  can  now  define  a  fundamental  limit  on 

the  delay  time  itself.  The  argument  is  similar  to  one  proposed  earlier 

7  8 

•  by  Keyes,  •  but  differs  in  that  a  f  irin  ' limit  on' "t^  is  set'.  ’  This  limit 

apparently  has  not  previously  been  recognized.  Equations  (1)  and  (2) 

imply  a  range'  of-  areas  that  must  satisfy  the  joint  inequalities  ’  ' 

P/Q  <  A  <  c2t2/N  .  (3) 

These  inequalities  illustrate  that  the  requirement  of  signal  propagation 
leads  to  such  a  small  system  for  a  given  delay  time  that  the  heat  can  no 
longer  be  removed.  However,  there  is  another  constraint  set  by  the 


4. 


energy  dissipation,  or  Pt.  =  E  .  Using  this  quantity  in  (3)  leads  to 

a  in 

the  final  limit** 

NE m /Qc2  <  t2  .  (4) 

The  inequality  in  (4)  appears  to  be  a  fundamental  lower  bound  on  the 
delay  time  itself.  It  is  interesting  to  note  that  the  degree  of  inte¬ 
gration  represented  by  N  and  the  heat  removal  rate  Q  are  as  important 
as  the  power-delay  product  of  the  individual  cells  in  determining  how 
fast  the  overall  chip  can  be  operated.  In  particular,  we  note  that  it 

requires  a  three  order-of-magnitude  reduction  in  E  to  reduce  the 

m 

minimum  t^  by  a  single  order-of-magnitude.  Increasing  the  integration 

level  actually  tends  to  worsen  the  minimum  delay  time  that  can  be 

achieved,  although  this  minimum  is  well  below  the  levels  discussed  in 

-19 

today's  technology.  Using  the  above  limit  found  by  Bate  of  10  Joule, 

2 

a  value  of  Q  of  20  W/ cm  (appropriate  to  a  Freon  cooled  computer),  and 
N  =  1,  a  lower  limit  of  0.01  picosecond  is  found.  The  clock  frequency 
corresponding  to  this  is  in  the  optical  range,  yet  seems  to  be  a 
fundamental  limit. 

The  Wire-Dominated  Chip 

In  some  applications,  such  as  gate,  array.. chips,- the  chip  itself  is- 
dominated  by  the  interconnection  wiring  and  is  referred  to  as  a  wire- 
dominated  chip.**  In  these  chips,  the  cell  size  is  determined  not  by  the 
devices  but  by  the  number  of  wires  that  must  run  through  the  cell 
itself.  When  this  limit  occurs,  further  reductions  in  the  active  area 
of  the  devices  themselves  will  not  significantly  increase  the  number  of 
logic  cells  per  unit  area  of  chip.  In  these  situations,  the  actual 
power-delay  product  of  the  device  itself  is  not  as  important  as  con¬ 
straints  of  charging  the  interconnection  capacitances  of  the  wiring. 
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Keyes^  has  argued  that  the  power-delay  product  of  such  a  wire-dominated 
chip  is 

Ptd  =  C^fK2 wVgVg/M  ,  (5) 

where  P  is  the  power  dissipation,  t^  is  the  delay  time,  M  is  the 
number  of  metallization  or  wire  layers  in  the  chip,  K  is  the  number  of 
wire  channels  of  width  w  required  per  logic  cell,  and  f  is  the 
fraction  of  these  channels  occupied  by  wires  with  capacitance  Cw  per 
unit  length  of  wire.  Here,  Vg  and  Vg  are  the  supply  and  signal 
voltages,  respectively.  The  form  of  (5)  is  easily  understood,  as  the 
right-hand  side  is  just  the  energy  stored  in  the  capacitance  of  the 
wires,  if  the  average  length  of  a  wire  is  K/M  and  the  average  number 
of  wires  per  cell  is  w*f*K. 

Since  the  cell  area  is  dominated  by  the  interconnection  wiring,  the 
cell  size  itself  is  approximately  given  by 

A  =  (Kw/M)2  .  (6) 

The  ability  to  dissipate  the  heat  generated  by  the  input  power  P  must 
also  be  included  in  the  discussion.  This  can  be  done  through  the  incor¬ 
poration  of  (1)  and  (5).  The  other  fundamental  limit,  mentioned  above, 
is..£hat. ,a  signal- must  t>e  able  to  propagate. across  the  entire  chip  in  the 
delay  time  t^,  as  discussed  above.  This  leads  to  (2).  Thus,  for  a 
chip  containing  N  square  cells,  this  . signal  propagation  limit  is. 

g 

expressed  by  combining  (1),  (2),  (5),  and  (6)  as 

t2  >  N%fKCwVgVs/Qc  .  (7) 

As  above,  this  constraint  is  essentially  independent  of  geometrical 
factors  such  as  w,  although  these  latter  factors  do  affect  slightly. 
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Using  (7),  a  lower  limit  to  the  delay  time  can  also  be  obtained  by 

taking  the  appropriately  most  favorable  limits  on  the  individual  param- 

9 

eters.  Heller  et  al.  have  shown  that  a  value  of  K  =  20  is  typical 
for  a  system  of  1000  gates.  Master  slices  with  this  number  of  gates 
have  been  reported^  and  do  approach  the  wire-dominated  limit.  Heat 
dissipation  rates  from  above  will  be  used.  The  question  of  line  capaci¬ 
tance  is  more  arguable,  but  for  future  small  chips,  the  fringing  and 
interline  capacitances  will  probably  limit  the  lower  value  of  Cw  to 
0.1  fF/micron.1*  Using  these  values  and  setting  c  to  the  speed  of 
light,  a  lower  limit  of  about  0.2  nonoseconds  is  obtained  for  the  delay 
time. 

The  significance  of  this  result  is  obvious.  Even  if  the  individual 
logic  gates  in  a  wire-dominated  chip  are  fast,  the  chip  itself  cannot 
have  a  delay  time  shorter  than  0.2  nanoseconds,  corresponding  to  a  clock 
frequency  of  about  1.0  GHz.  While  the  constraints  used  here  are  appro¬ 
priate  to  a  master  slice  chip,  the  conclusion  is  expected  to  be  more 
general  and  apply  to  a  wide  variety  of  array  type  logic  designs  in  which 
each  cell  must  be  able  to  communicate  with  each  other  cell  or  with  cache 
memories  at  some  distance. 

Current  Technologies 

‘  Midway  between  the  two  limiting  cases  discussed  above  are  those 
cases  pertaining  to  current  efforts  to  obtain  high  speed  logic.  It  is 
of  interest  to  try  to  set  practical  limiting  values  of  the  expected 
delay  times  for  these  logic  families.  While  no  direct  methods  along  the 
lines  of  the  above  discussions  are  available,  we  can  however  use  the 
above  results  to  infer  some  practical  values.  This  can  be  done  in  the 
following  way.  First,  we  use  (5)  to  determine  the  smallest  value  of  K 
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for  which  the  speed-power  product  of  the  active  devices  is  smaller  than 

the  right-hand  side  of  (5)  for  some  appropriate  technological  param- 

9 

eters.  Using  the  results  of  Heller  et  al . ,  the  maximum  number  of  gates 
which  can  be  integrated  on  the  chip  without  it  becoming  wire-dominated 
(in  a  power-delay  sense)  can  be  then  estimated.  Finally,  these  values 
can  be  used  in  (4)  to  determine  an  appropriate  delay  time.  We  apply 
this  below  to  a  high-speed  GaAs  implementation. 

For  the  case  of  GaAs,  we  can  take  typical  values  as  M  =  2, 
Vg  =  Vg  =  5  V,  f  =  0.5,  and  w  =  1  micron  (increasing  w  will  lead  to  a 
reduction  in  the  resulting  N  for  high-speed  operation).  We  also  can 
take  the  lower  limit  on  to  be  0.1  pF/micron  as  discussed  above. 

For  these  values,  (5)  becomes 

E  .  S  6  x  10-16  K2  .  (8) 

wire 

Experimental  values  of  the  speed-power  product  for  GaAs  MOSFETs  are  on 

12 

the  order  of  0.05  picojoule  for  the  above  parameters,  which  gives  a  K 
of  about  10  as  the  maximum.  The  results  of  Ref.  9  suggests  that  this 
corresponds  to  an  N  of  about  200,  which  yields  a  delay  time  of  about 
6.5  picosecond  from  (4).  The  driving  function  on  this  limit  is  of 
course  the  experimental  energy  dissipation  in  the  logic  gates  them¬ 
selves.  Lowering  this  value  by  for  example  using  HEMT  technology  leads 
to  a  slightly  lower  value  for  the  delay  time. 

In  summary,  we  have  considered  the  limits  imposed  by  heat 
dissipation  and  signal  propagation  velocity  in  integrated  circuit  chips. 
The  two  contradictory  limits  of  large  area  for  power  dissipation  and 
small  area  for  signal  propagation  have  set  a  limiting  constraint  on  the 
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achievable  delay  time  in  real  thermodynamically  irreversible  computers. 
Considerations  of  the  fundamental  energy  dissipation  limits  suggested  by 
quantum  mechanical  calculations  lead  to  a  lower  limit  on  the  delay  time 
of  about  0.01  picoseconds.  On  the  other  hand,  constraints  imposed  on  a 
master  slice  type  chip  are  more  general  and  lead  to  a  limit  on  the 
achievable  delay  time  of  about  0.2  nanoseconds,  corresponding  to  a  clock 
frequency  of  the  order  of  1  GHz  or  less,  due  to  the  domination  by  the 
interconnection  wiring.  These  latter  are  more  general,  n  that  any 
array  type  logic,  in  which  each  cell  must  communicate  wi  each  other 
cell  or  with  large  cache  stores  of  memory  at  some  dista  will  also 
likely  be  wire  dominated  and  show  the  same  limitations.  "  >  implies 
that  designs  for  fast  logic  must  be  prepared  in  such  a  manner  that  the 
cell  area  is  not  dominated  by  the  requirements  of  necessary  wiring 
interconnections  if  the  speed  is  to  be  achieved.  Considerations  of  fast 
logic,  such  as  GaAs  technology,  lead  to  intermediate  delay  times  in  the 
1-10  picosecond  range,  if  wire  dominance  can  be  avoided. 

These  results  are  significant.  In  particular,  it  is  readily 

apparent  that  one  can  expect  delay  times  in  future  logic  circuitry  to 

get  down  to  the  time  scale  of  energy  and  momentum  relaxation  times 

.  themselves,  especially  in  alternate  materials  such  as  GaAs  or  InP. 

Moreover,  the  time  scale  is  on  the  order  of  the  duration  of  overshoot 

13-15 

velocities  themselves,  so.  that  higher  drive  currents  can  be 

expected  from  future  devices.  It  is  thus  imperative  that  we  begin  to 
fully  understand  the  transport  in  the  transient  dynamic  response  (TDR) 
regime  appropriate  to  these  future  VLSI  technologies. 
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II.  SUMMARY  OF  RECENT  RELEVANT  WORK 
In  recent  years,  much  interest  has  centered  upon  the  transient 
dynamic  response  of  electrons  as  it  impacts  carrier  transport  through 
small  spatial  regions  of  high  electric  field.  With  recent  improvements 
in  technological  fabrication  of  very-short-channel  devices,  this  problem 
has  become  not  only  of  theoretical  interest  but  of  practical  interest 
as  well.  For  instance,  in  the  pinch-off  region  of  a  short-gate  field- 
effect  transistor,  the  carriers  injected  at  the  source  move  by  a  combi¬ 
nation  of  drift  and  diffusion  in  a  very  high  electric  field.  Then  the 
transit  time  of  the  carriers  under  the  gate  can  be  shorter  than,  or  of 
the  same  order  of  magnitude  as,  the  time  needed  to  establish  a  steady- 
state  high-field  distribution  function.  In  fact,  a  condition  for  this 
to  occur  is  that  the  transit-time  in  the  high-field  region  be  comparable 
to  the  momentum  relaxation  time  thus  causing  the  velocity  to  increase, 
but  much  shorter  than  the  energy  relaxation  time.  Thus,  on  average  the 
carriers  may  transit  through  a  considerable  portion  of  the  high-field 
region  with  almost  their  low  field  mobiilicy  even  though  the  applied 
field  corresponds  to  the  saturated  velocity  range. ^  This  is  true  not 
only  for  the  first-order  moment  of  the  distribution  function  of  the 
carriers  (drift  velocity),  but  also  is  true  for  higher  order  moments, 
and  especially  for  diffusion  (related  to  the  second  moment).  The  diffu¬ 
sion  coefficient  is  one  of  the  most  important  parameters  required  in 
modeling  semiconductor  devices,  it  is  not  only  necessary  for  evaluating 
operating  characteristics  and  frequency  characteristics  but  it  provides 
also  fundamental  characterization  of  velocity  fluctuations  in  the  system 
and  their  contribution  to  noise  in  the  device.^  Diffusion  actually 
is  a  process  depending  upon  velocity  correlation  and  the  relationship 
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between  diffusion  and  drift,  as  expressed  by  the  Einstein  relation,  is  a 
steady-state  relation.  Indeed,  tractable  results  for  the  steady-state 
hot  electron  problem  have  only  recently  been  achieved.  The  problem  in 
the  transient  region  is  complicated  by  the  fact  that  the  random-walk 
equations  governing  transient  diffusion  do  not  reduce  to  normal  Fick's 
law  behavior  on  time  scales  comparable  to  the  relaxation  process,  a 
result  of  the  general  non-Markovian  and  nonstationary  nature  of  trans¬ 
port  on  these  time  scales.  We  have  addressed  some  of  these  problems 

with  the  help  of  a  Retarded  Langevin  Equation  (RLE)  as  well  as  with 

ensemble  Monte  Carlo  (EMC)  techniques  in  order  to  approach  the  random 

walk  of  the  carriers.  In  the  former  case,  we  define  a  nonstationary 

two-time  correlation  function  for  the  velocity  fluctuations  which  can  be 
related  to  a  transient  diffusion  coefficient.  The  formal  solution  of 
the  RLE  allows  us  to  derive  a  general  expression  for  the  correlation 
function. 

Retarded  Langevin  Equation 

We  consider  an  ensemble  of  carriers  initially  at  equilibrium  with 

the  crystal  lattice.  The  ensemble  is  represented  by  a  Maxwell-Bol tzmann 

2 

distribution  in  v-space  (<v>  =  0  and  <v  >  =  3kgT^/m  with  T^  the 
lattice  temperature).  At  a  certain  time,  referred  as  t  =  0,  we  apply  a 
macroscopically  homogeneous  and  steady  electric  field  whose  amplitude 
corresponds  to  hot  carriers.  These  conditions  give  rise  to  a  TDR  regime 
in  which  the  system  relaxes  toward  a  nonequilibrium,  steady-state  and 
often  exhibits  a  velocity  overshoot. 

The  motion  of  the  particles  is  governed  by  a  Retarded  Langevin 

18  19 

Equation  for  the  velocity  of  the  carriers,  and  is  written  as  * 
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d  V  _  • 

“  dt  =  ""  f  Y(t-u)  v(u)du  +  R(t)  +  qEQh(t)  ,  (9) 

o 

where  R(t)  is  a  random  force  symbolizing  the  random  (nonregular)  part 

of  the  collisions  of  the  carriv .  ;  with  the  lattice  (no  carrier-carrier 

interaction  is  considered  here),  Eq  is  the  external  field,  and  h(t) 

is  the  Heavyside  function.  y(t)  is  the  memory  function  of  the  system 

and  is  related  to  the  correlation  function  of  the  total  force  applied  to 

the  system.  For  instance,  in  the  case  of  an  equilibrium  regime,  we 

2  2 

would  have  y(t)  =  <R(0)R(t)>/m  <v  (0)>,  in  absence  of  external  forces. 

Equation  (9)  is  a  non-Markovian  form  of  the  Langevin  equation, 
since  the  rate  of  change  of  the  velocity  at  t  not  only  depends  on  the 
velocity  at  that  time  but  also  depends  on  all  past  time.  Further,  it  is 
a  nonstationary  equation,  since  the  lower  bound  in  the  intergral  refers 
to  that  time  where  tho  disturbing  field  was  applied.  It  is  generally 
admitted  by  now  that  oniy  an  equation  such  as  (9)  can  describe  very-fast 
processes,  and  in  particular  the  TDR  regime  can  be  described  in  this 
way.  This  equation  can  be  readily  manipulated  to  lead  to  the  general¬ 
ized  Kubo  form^° 

qE  t 

vd(t)  =  - J  ♦  (0,t’)dt’  (10) 

m<v*(0)>  o  Av 

This  relationship  existing  between  the  first  and  second  moment  of  the. 
velocities  of  the  carriers  is  a  direct  intrinsic  property  of  the  RLE 
used  to  describe  the  TDR  regime,  and  is  a  statement  of  the  familiar  Kubo 
formula  found  in  equilibrium  statistical  mechanics. 

Ensemble  Monte  Carlo  Technique 

The  ensemble  Monte  Carlo  used  here  has  been  developed  by  Lebwohl 
21 

and  Price  and  subsequently  used  and  described  in  great  detail  by  the 
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principal  investigator.  An  ensemble  of  >5000  electrons,  initially  in 
equilibrium,  is  subjected  to  a  strong  electric  field  step  at  t  =  0,  and 
the  transient  response  is  studied.  This  procedure  is  applied  to  n-Si, 
biased  along  the  < 1 1 1 >  directions.  Nonparabol icity  is  included  with 
a  =  0.5  eV_1. 

Both  sides  of  (10)  have  been  calculated  in  the  simulation  and  are 
compared  in  Fig.  1.  The  agreement  is  within  the  accuracy  of  the  EMC 
method  and  leads  to  the  conclusion  that  the  RLE  can  correctly  describe 
the  motion  of  hot  carriers  in  the  TDR  regime.  From  (10)  we  see  that  a 
linear  increase  of  the  velocity  (<v(t)>  =  eFt/m*),  as  expected  for  bal¬ 
listic  transport,  can  only  occur  when  is  constant  in  time.  In 
Fig.  2,  we  show  the  temporal  evolution  of  <J> (t  >  tQ)  for  three 
different  values  of  tQ ,  including  t  =  0.  It  is  evident  from  this 
figure  that  the  time  duration  over  which  <J)^v  is  constant,  correspond¬ 
ing  to  a  ballastic  rise  of  the  velocity,  is  exceedingly  short,  perhaps 
only  a  few  femtoseconds,  even  though  the  mean  free  time  is  much  longer. 
We  also  note  that  the  MC  results  show  a  general  trend  in  that  <J>^v(t ,  tQ) 
decays  more  rapidly  for  longer  t  ,  corresponding  to  a  higher  tempera¬ 
ture  and  more  rapid  randomization  of  the  ensemble  as  time  increases. 


0  01  02  03  04 

t<ps) 

Figure  1.  Comparison  of  the  ensemble  drift  velocity  (A)  and  integral 
of  the  velocity  autocorrelation  function  (•)  as  a  function 
of  time. 
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Figure  2.  Normalized  velocity  autocorrelation  function  (t,t©) 

for  three  initial  times  t  . 

o 

» 

Another  problem  in  the  usual  treatment  of  ballistic  transport  is 

related  to  the  energy  balance  equation.  Normally,  the  contribution  of 

the  thermal  energy,  due  to  the  random  motion  of  carriers,  is  completely 

neglected.  But,  this  can  be  shown  to  be  the  dominant  term.  A  qualita- 
23 

tive  explanation  of  the  processes  on  very  short-time  scales  is  the 

following:  when  an  electric  field  is  turned  on  at  t  =  0,  the  ensemble 

of  electrons  begins  to  respond  instantaneously  by  a  shift  in  momentum 

space,  causing  an  instantaneous  rise  of  the  velocity.  However,  we  don't 

have  an  instantaneous  increase  of  velocity  fluctuations,  that  is  a 

spreading  of  the  ensemble  in  momentum  space.  Only  when  the  collisions 

begin  to  break  up  the  correlation  of  the  ensemble  with  the  initial  state 

jean  the  energy  begin  to  evolve.  The  memory  effect  in.  the  evolutioa  of 

the  energy  is  not  taken  into  account  in  the  equations  normally  used  for 

ballistic  transport.  A  first  attempt  to  express  this  in  a  formal  way 

24 

leads  to  the  following  equation 

t 

g  =  qFvd(t)ll  -  *Av(t,0)]  -  /  dt’[E(t-t')  -  EoJxe(f)  , 


(11) 
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where  Xfe(t)  is  a  decay  function  related  to  the  energy  correlation 
function.  In  Fig.  3  we  plot  the  initial  increase  of  the  energy  as 
calculated  by  the  EMC.  For  comparison,  we  also  show  a  curve  corre¬ 
sponding  to  the  forcing  term  in  (11)  (continuous  curve)  and  to  a 
ballistic  regime  (dashed  curve).  It  is  evident  that  at  very  short 
times,  the  role  of  the  memory  function  is  significant,  and  that  colli¬ 
sions  effects  are  already  evident  in  the  EMC  data  for  t  >  0.04  psec. 
Equation  (11)  was  derived  under  the  assumption  that  the  energy  associ¬ 
ated  with  the  drift  of  the  ensemble  is  much  smaller  than  the  thermal 
energy.  Unfortunately,  this  is  not  the  case  for  Si  for  times  shorter 

than  0.1  ps,  as  shown  in  Fig.  4  where  the  two  energies  are  calculated 

2 

from  the  EMC  for  a  field  of  50  kV/cm  (E.  =  m*v,/2).  Thus,  in  order  to 

ad 

formally  solve  the  problem  of  TDR,  we  will  have  to  go  back  to  the  origi¬ 
nal  equations  and  try  to  obtain  decoupled  equations  for  energy  and 
momentum . 

It  has  been  shown  that  a  time  dependent  diffusion  coefficient  can 
be  defined  in  the  nonstationary  regime  as 

D(t)  =  Hi <[Ax(t)]2>  =  ^  ^v(t,'t)dt'  •  (i2) 

o 


Figure  3.  Response  of  the  average  energy  (in  degrees  K)  of  the 
ensemble.  The  data  point  are  EMC  results,  the  continu¬ 
ous  curve  represent  the  forcing  term  of  Equation  (11) 
and  the  dashed  curve  represents  the  ballistic  rise. 
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Figure  4.  Evolution  of  the  drift  (T^)  and  thermal  (T  )  energy  of 
the  ensemble  (left  scale)  and  the  ratio  of  the  two  (right 
scale)  as  a  function  of  the  time. 

2 

Using  the  EMC  method,  $^v(t*  ,t)  and  <(Ax(t))  >  can  be  calculated 
independently  and  the  validity  of  (12)  can  be  checked.  In  the  present 
work,  the  integral  is  calculated  by  computing  25  correlation  functions 
at  equally  spaced  times  in  the  range  0-0.4  psec.  This  result  is 
illustrated  in  Fig.  5,  where  the  two  sides  of  (12)  have  been  calculated 
for  an  electric  field  of  50  kV/cm.  The  agreement  is  good,  within  the 
error  due  to  the  numerical  integration  and  differentiation  performed. 


Time  ( pi ) 

Figure  5.  Comparison  of  the  diffusion  coefficient  as  a  function  of 
time,  obtained  from  the  correlation  function  (x)  and  from 
the  time  derivative  of  the  average  square  displacement 
(•)• 
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Since  the  steady-state  diffusion  coefficient  can  be  obtained  as  a 
limiting  case  of  (12),  this  expression  can  be  assumed  to  be  a  general 
definition  of  the  diffusion  coefficient  for  both  stationary  and  non¬ 
stationary  regimes.  The  time  evolution  of  D(t),  as  obtained  from  EMC 
methods  and  from  the  RLE  approach,  is  shown  in  Fig.  6,  for  two  different 
electric  fields.  The  most  important  feature  is  that  the  diffusion 
coefficient  exhibits  an  overshoot.  This  is  related  to  the  oscillatory 
shape  of  the  correlation  function  <j>^v(t',t),  which  arises  from  a  com¬ 
bination  of  momentum  and  energy  relaxation.  This  also  leads  to  a  veloc¬ 
ity  overshoot  affect.  This  is  present  both  in  the  theoretical  and  MC 
results,  which  are  found  to  be  in  fairly  good  qualitative  agreement. 


Figure  6.  Diffusion  coefficient  as  a  function  of  time  for  two 
electric  fields  as  obtained  from  EMC  results  (dashed 
curves)  and  RLE  results  (solid  curves). 
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